# 大模型技术栈-实战与应用
- 训练框架
    - deepspeed
    - megatron-lm
    - colossal-ai
    - trlx
- 推理框架
    - triton
    - vllm
    - text-generation-inference
    - lit-llama
    - lightllm
    - TensorRT-LLM(原FasterTransformer)
    - fastllm
    - inferllm
    - llama-cpp
    - openPPL-LLM
- 压缩框架
    - bitsandbytes
    - auto-gptq
    - deepspeed
- embedding框架
    - sentence-transformer
    - FlagEmbedding
- 向量数据库 [向量数据库对比]("https://www.jianshu.com/p/43cc19426113")
    - faiss
    - pgvector
    - milvus
    - pinecone
    - weaviate
    - LanceDB
    - Chroma
- 应用框架
    - Auto-GPT
    - langchain
    - llama-index
    - quivr
- python前端
    - streamlit
    - gradio
- python API工具
    - FastAPI+uvicorn
    - flask
    - Django